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Abstract 

We construct an infinitely exchangeable process on the set £ of subsets of the power set of the 
natural numbers N via a Poisson point process with mean measure A on the power set of N. Each 
E e £ has a least monotone cover in J 7 , the collection of monotone subsets of £ , and every monotone 
subset maps to an undirected graph G G Q, the space of undirected graphs with vertex set N. We 
show a natural mapping £ — > J 7 — > Q which induces an infinitely exchangeable measure on the 
projective system g rest of graphs Q under permutation and restriction mappings given an infinitely 
exchangeable family of measures on the projective system £ rest of subsets with permutation and 
restriction maps. We show potential connections of this process to applications in cluster analysis, 
machine learning, classification and Bayesian inference. 



1 Introduction 

Here, we show a construction of an infinitely exchangeable family of random graphs which is based on an 
associated Poisson point process on the power set of the natural numbers N. We provide a necessary and 
sufficient condition for the induced random graph to be infinitely exchangeable and discuss a potential 
use for this model in the area of cluster analysis and stochastic classification, which has been previously 
studied in a statistical and machine learning context in previous work by Jordan [2] (with Blei and Ng) 
and [7] (with Broderick and Pitman) McCullagh \10\ 111], but outside of the realm of network analysis. 

We now introduce preliminary material and notation which is critical to our treatment. 



1.1 Projective systems 

A projective system associates with each finite set [n] a set Q n and with each one-to-one injective map 
(p : [m] — > [n], m < n, a projection ip* : Q n —¥ Q m which maps Q n into Q m such that 

• if ip is the identity [n] — > [n] then ip* is the identity Q n — > Q n and 

• if ip : [I] — > [m], I < m, and ip* : Q m — > Qi is its associated projection, then the composition 
(ipip) : [I] — > [n] satisfies (ipip)* = ip*(f* : Q n -> Qi- 

If Q n is the set of subsets of [n] 2 , i.e. the space of directed graphs with n vertices, one can define the 
projection Q n — > Q m either by restriction or delete-and -repair. Each A. G Qn can be represented as an 



i 



n x n matrix with entries in {0, 1} such that Aj = 1 if G A and Aj = otherwise. For each n > 1, 
let (£> n ,n+i be the operation on which restricts A to the complement of {n + 1}. In matrix form, 

Pn,n+iA =: -A|[ n ] is the nxn matrix obtained from A by removing the last row and last column of A and 
keeping the rest of the entries unchanged. It is clear that the compositions tp m ,n '■= <Pm,m+i ° ■ ■ " °¥>n-i,ra 
for m < n are well-defined as the restriction of A G Q n to [m] by removing the last n — m rows and 
columns of A. 

For n > 1, we write S n to denote the symmetric group of permutations of [n], i.e. one-to-one maps 
[n] —> [n]. Each a £ S n acts on each element A G Q n componentwise in the usual way. That is, (i, j) G A 
if and only if (cr(i), cr(j)) G The restriction maps (ip m ,m m < n) together with permutation maps 

(a G <S„, n > 1) and their compositions make Q := {Qn-, n>\)& projective system. 

Another way to specify a projective system on (Q n ,n > 1) is by delete- and-repair. For n > m > 1, let 
ip m act on ^4 G Q n by removing the mth row and column of A and directing an edge from each i in 
{j G [n] : (j,m) G A} to each k in {j € [n] : (m,j) G ^4}. In other words, ^> m .A is obtained by deleting 
the vertex labeled m from A and connecting two vertices i and fc by a directed edge from i to k if both 
(i, m) and (m, A;) are elements of A, i.e. there is a directed path z — > m — >■ A; in A 

For m < n, define := Y'm+i ° " " " ° "0n- Plainly, ^> m n is well-defined since for each n > 2, 

V>n-2,ra = ^n-i ipn = VVi ° VVi-i and W = V'i.m ip m ,n- The delete-and-repair maps (ipm,n,m < n) 
together with permutation maps (cr € 5 n ,n > 1) and compositions also make (Q n ,n > 1) a projective 
system, which differs from the above projective system based on restriction maps. 

Throughout the rest of this paper, for a system Q := (Q n ,n > 1) we write ipm,n to represent the 
restriction maps for collection Q and t^m,n to represent the corresponding delete-and-repair maps. We 
write Q rest , resp. Q dr , to denote the projective system on Q described by the restriction, resp. delete- 
and-repair, maps together with permutation maps. 

1.2 Power set of 2^ 

Let E n = 2 2 '™ 1 denote the power set of all subsets of 2^ n \ the power set of [n]. For each n > 1, we define 
the partial order < £ on £ n by 

E < £ E' -i=^ [eeE^ee E'\. 
Therefore, {{1}} < £ {{1},{1,2}} but {{1}} and {{1,2}} have no relation under < £ . 

We define restriction maps : £n — >• £m as follows. For E : = {ej : 1 < % < k} G £ n , we define 

(p^ n (E) := E\[ m ] := {e« (~l [m] : 1 < i < k}. The maps (tpm,n-> m — n ) Preserve < £ on £ since 

E < £ E' [ee£=>eefi'] 

and so 

e € E e n [m] G y4 >n £?, e D [m] G ¥&,n S ^£ 

Permutations act on elements of E componentwise, i.e. for n > 1,E G £ n , and <r G 5 n , we have 
a(E) := {cr(e) : e G E 1 }. We write £ rest to denote the projective system £ := (£ n ,n > 1) together with 
restriction maps {ip^^^m < n) and permutation maps (a G S n ,n > 1) and their compositions. 



1.3 Monotone sets 



A subset A C 2^ is monotone if a £ A implies 2 a C A. For example, the set A = {0, {1}, {2}, {3}, {1, 2}} = 
({1,2}, {3}) is a monotone set with maximal elements {1,2} and {3}, which constitute the generating 
class of A, written G(A). An element a of the generating class of a monotone set A is a maximal element 
of A in the sense that no other element a' £ A contains a as a subset. The generating class G(A) of 
a monotone subset A consists of all maximal elements of A. We write T n as the set of monotone sets 
taking elements in 2^. Note that a monotone set is uniquely determined by its generating class, and 
so we will write A G T n and G(A) to describe the same object, i.e. the monotone set A, whenever it 
is convenient to do so. Every subset E of 2^ has a least monotone cover in T n , which we denote by 
a(E) and is given by a(E) := {2 a : a G E}. 

For each n > 1, T n is a partially ordered set induced by the partial order inclusion on 2^ n \ i.e. for 
A,B£ T n , we say A <jr B if and only if each a G G(A) is a subset of some b G G(B). And for any pair 
A, B with A < B, the intervals [A, B], (A,B] and [A, B) are well-defined subsets of T n - Note that we 
intend the symbols C and C to have strictly different meanings in this paper. In particular, we write 
A C B to mean A is any subset of B, while we write A C B to mean A is a proper subset of B, i.e. 
A Q B but A ^ B. This distinction becomes important in the next section. 

We define the operation restriction if^ n+1 : T n +\ — > J~ n as the operation which maps G{A) \— > G(A) n 
[n\. That is, for A G T n+ i with G(A) := {Ai, . . . , A fc }, (p^ >n+1 A := (Ai n [n] : i = 1, . . . , k). For m < n, 
we define <^ n := <^ m+1 o ■ ■ -ocpF^ n in the usual way by composition, and the collection of restriction 
maps T := (<^^ n ,m < n) together with permutation maps makes (J- n ,n > 1) into a projective system, 
written jr rest _ Here, a permutation a 6 S n acts on a monotone set ^4 G T n by acting componentwise 
on its generating class, i.e. a G G(A) if and only if o~(a) G G(ct(j4)). 

Also note that the inverse mapping (p^n+i associates with each F G Tn an interval, and also maps 
intervals in Tn to intervals in Tn+i- in particular, for n > 1 and A := (Ai, . . . , A^) G J^, we have 

= [<G(A)>, U {n + 1} : j = 1, . . . , k)\ . 

1.4 Undirected graphs 

For n > 1, an undirected graph G G Q n is a pair (V, -E 1 ) of vertices 1/ and edges E whereby the number 
of vertices #V of G is n, and without loss of generality we assume V = [n] := {1, . . . , n}, and the edges 
are a subset of [n] 2 such that G implies (j, i) G -E 1 . 

For each n > 1, the elements of correspond to the symmetric subsets of [n] 2 , e.g. for A G G n , 
(i,j) G A if and only if G A, and hence Q := (Gn, n > 1) is a projective system under both 

restriction and delete-and-repair, as described in section [LT1 For each n > 1, C/ n is a poset under <g 
where for G, G' G £ n 

G< g G'^[{y}eG^{i,i}eG']. 

For clarity of notation, write n and n to denote the operations of restriction and delete-and-repair 
respectively on the projective system £ rest and Q dv respectively. 



1.5 Some category theory 



The relationship between the collections £, F, and Q is described in a straightforward way by elementary 
concepts in category theory [T]. 

A category C consists of objects A,B,C, . . . and arrows f,g,h,... between objects so that for each 
arrow / £ C there are objects dom(/) and cod(/) in C, the domain and codomain respectively of 
/, and we write / : dom(/) — > cod(/). Given /,j 6 C such that cod(/) = dom(g), the composite 
g o f : dom(/) — > cod(g) is an arrow in C. Also, for each object A £ C there is an identity arrow 
1a '■ A — > A, and all arrows of C must satisfy associativity and preservation under composition with 
the identity functions. 

In each of the categories we define below, there is at most one arrow between any two objects. Therefore, 
if an arrow / corresponds to the composition of arrows g o h, these must represent the same arrow. 
Under this assumption, for a pair of objects we need not make explicit all of the various compositions 
of arrows which result in an arrow between these objects, as they are implicitly assumed to be there, 
and are all the same arrow. 

We define three categories as follows. We write £ rest for the category with objects given by the elements 
of £ and arrows given by the restriction (c^ n ,m < n) and permutation (a £ S n ,n > 1) maps, and 
compositions of these maps. That is, for m < n and E £ £ n , there is an arrow E — > ip^^E = Eu m y We 
denote the arrow between E and -E|[ m ] by tp^^E. Likewise, we define jr rest and g rest to be the category 
with objects given by the elements of F and Q respectively, and arrows defined by the restriction maps, 
(ip- F . n ,m < n) and (ip^ nn ,m < n) resp., and permutation maps. For example, in J 71 " 68 *, we have 
({1,2}, {3}} ({1,2}). 

A functor between categories C and D is a map F : C — > D which takes objects in C to objects in D 
and arrows in C to arrows in D. There are natural functors a : £ rest — > jr rest anc l ft : jr rest _s. g rest ^ 
which we define as follows. 

(i) For each n > 1 and E £ £ n , a{E) £ T n is the least monotone cover of E. 

(ii) For each n > 1 and F := (F\, . . . ,Fk) £ F n , (3(F) = Gf £ Q n where £ Gf if and only if 
{hj} Fj f° r some j = 1, . . . , k. 

(iii) For m < n, a(</4,J = V?m,n and /%£,J = (/4,n so that (« ° £)(</4,J = 

(iv) For every n > 1, a(o~) = a and /3(a) = a for all a £ S n . 

Functors between posets preserve partial ordering. For • = £,F,Q, we have defined the partial order 
<» in previous sections. For m < n and A, A' £ •„ such that A <, A', it holds that ip m mn A <, Lp* mn A' . 
Hence, for each fixed 1 < m < n, the restriction maps (</7mn) define a functor « n — » « m , where we take 
the elements of •„ and « m as objects and the partial order <, defines the arrows, i.e. there is an arrow 
A ->• A' in •„ if and only if A <. A'. 

We can also regard the collections £ rest , jr rest ( ^rest ag p ar tially ordered sets with partial order <.rest, 
• = £, F, Q, defined as follows. For m < n, A £ « n , A' £ » m , 

A' < .rest A < > A — if m n A. 

The functors a,/3 and (3 o a above preserve the partial orders < £reB t, <jrr eB t and <Qr eB t. 



Below, we show a construction of infinitely exchangeable random elements of £ rest . In particular, we 
construct an infinitely exchangeable random monotone set by projecting from a Poisson point process 
X on the power set to a random subset X* £ £rest Q £ power set to the least monotone cover 
a(X*) £ J 71 " 65 *, which corresponds to a random graph /3oa(X*) £ Q rest . This procedure looks like this 

X ->* X* ^ a a(X*) ->^o a(X*). 



Projective systems in statistics The relevance of category theory and projective systems in statis- 
tical modeling is discussed in detail by McCullagh [9j. Here we have introduced projective systems for 
the collection of subsets of 2^ , their associated monotone subsets of 2^ and their associated undirected 
graphs in [n] 2 . 

The choice of projection, i.e. either restriction or delete-and-repair, on Q is intended to reflect the notion 
of subsampling in statistics, and each admits its own statistical interpretation in terms of subsampling 
which may be appropriate depending on the application, and the actual way in which observations are 
made. In the study of directed graphs, in particular permutations, delete-and-repair is often used, see 
e.g. the Chinese restaurant process (CRP) on permutations |12j . However, for our purposes, restriction 
maps are a natural choice. 

To be specific, if we think of a graph as modeling a social network, the edges of the graph represent 
social links among the agents (nodes) in the population. When observing a social network, there 
are several sampling methods which are reasonable. The most intuitive of these are perhaps node 
sampling and snowball sampling. In node sampling, we first sample a set {ui, . . . ,u n } of nodes and 
subsequently observe any edges between these nodes. In this setting, restriction accurately reflects our 
sampling method since removal of a node from our sample, e.g. u n , removes any edges which involves 
that node from our sampled graph. In snowball sampling, we start with a set of nodes, without 
loss of generality suppose we start with one node u, and proceed as follows. Given Uq = u, sample 
U\ := {v £ U : v ~ Uq}, the set of nodes v adjacent to some node in Uq in the graph we are observing. 
Subsequently, put Uk+i '■= {v £ U : v ~ Uk, v £ U)., Uk-i, ■ ■ ■ , Uo}. In snowball sampling, we stop 
at some arbitrary level k. This way of sampling results in a sample of all nodes at radius k or less 
from u, and depends on our choice of u and the network structure. In this case, removal of a node 
v £ V := IJi=o Ui results in removal of all other nodes w G V such that v lies along every path between 
u and w. The result of this type of subsampling is not described simply via restriction. Below, we 
discuss only those projective systems characterized by the restriction maps, and the implications of this 
method of sampling on inference. 



1.6 Infinite exchangeability 

A family (p n ,n > 1) of probability measures on a projective system Q := (Q n ,n > 1) is infinitely 
exchangeable if it is invariant under both the action of permutations, called finite exchangeability, and 
selection according to the projection maps (D mn , m < n) associated with the system, called consistency. 
For example, a family of measures (p n ,n > 1) on Q is infinitely exchangeable if 

• for each n > 1 and a £ <S n , p n (A) = p n (o~(A)) for every A £ Q n and 

• for every m < n, p m (A) = p n (D^ n (A)) for every A £ Q m . 



An infinitely exchangeable collection of measures uniquely characterizes a measure p on the infinite 
space associated with Q through its finite-dimensional distributions and invariance under projection 
and permutation maps. 

Above we have defined functors a : £ rest — > jr rest anc | p : j-rest _^ grest Q ng p r0 p er ty Q f functors is 
that a(E\ o £? 2 ) = a(-Ei) o a{E2). Hence, we have the following elementary lemma which is useful for 
our purposes below. 

Lemma 1.1. Let 7 : C — > D be a functor between categories C and D where the objects C of C and D 
of D /orm a projective system under mappings ir c and 7r D respectively, and the arrows are defined by 
the partial ordering induced by the projections ir c and 7r D . Let /i c be a probability measure on C and 
let /i D = ^c7 _1 be the distribution induced on the objects o/D by fi c through the functor 7. If fj, c is 
invariant under tt c , i.e. fj, c = /t/ C 7r c , then ^ D is invariant under 7r D . 

Proof. Since 7 : C — > D is a functor, we have 7(5 o /) = 7(g) o 7(7) for any arrows f,g € C such that 
dom(g) = cod(/). Let c be an object in C and c' := 7r c (c), so that we have an arrow c ^• 7r c( c ) d which 
we denote by 7r c (c), and the action of the functor 7 is such that 

( 7 o vr c )(c) = 7 (^ C (c)) = 7^(7^)) = (vr D o 7 )(c'), 

and we have 7 o tt c = 7r D o 7 and invariance of the induced measure /x D follows. Indeed, for d = 7(c) 
and d! = 7(0'), 

MD(d') = VcT\d>) = (^"h-Hd') = ^ c ( 7 o^ c )-V) = Aic^or)" 1 ^) = (^cT- 1 )^ 15 "^) = 

□ 

This will simplify our proofs below for infinite exchangeability of the random graph induced by a random 
subset of 2N. 



2 Construction of an infinitely exchangeable random graph 

Let X be a Poisson point process on 2N with mean measure A so that {X a :ae 2W} 

is a collection of 

independent Poisson random variables with each X a having mean A(a) > 0. By ignoring multiplicities, 
each realization of this process defines a random subset X* := {a C [n] : A a > 0} G £ n which consists 
of those points a G 2^ for which X a > 0. The distribution of X* is given by 

F(X* CE)= Yl exp{-A(a)}. 

aC[n]:a£E 

In general, X* will not be monotone, but, as discussed above, it will have a least monotone cover given 
by a(X*) := {2 a : a G A*} G Tn- 

It is straightforward to compute the induced distribution of a (A*) on T n under the partial ordering 



<jr. That is, for A £ T n , let A denote the complement of A in 2^ n \ then 

P„(a(X*) <jrA) = l[P(X a = 0) 



Y[ exp{-A(a)} 



= ex PS -^_ A («) > • 

[ a£A J 

For each n > 1, let 0^ and O^ 7 be the minimal element of f n and respectively, i.e. 0^ = 0^ = 0, 
and define /x n ([0^,A]) := P n (X* < £ A) = exp{— Y^ a <z[n]-aiA A «(°)} to be the probability measure on 
£ n and i/ n ([0^, 2?]) := ¥(a(X*) <jr B) = exp {— X^fceB A n (&)} the probability measure on T n induced 
by fi n through a, shown above, for some non-negative mean measure A n (-) on 2^. 

For a subset A C N, let ffA denote the cardinality of A, i.e. the number of elements in A. We now 
show that a necessary and sufficient condition for fj, n , and hence u n , to be infinitely exchangeable under 
action of {'pf nn ,m < n) is that for every n > 1, A n (a) = A n (#a) for some collection of non-negative 
real numbers (A n (r),n > 1,0 < r < n) which satisfy 

A n (r) = A n+ i(r) + A n+ i(r + l). (1) 

Theorem 2.1. The collection (fi n ,n > 1) of probability measures on £ rest and (v n ,n > 1) on j^ rest 

are infinitely exchangeable if and only if the collection of mean measures (A n ,n > 1) satisfy 

(a) for every n > 1, A n (a) = A n (#a) for all a € J- n for some collection of measures (A n (-),n > 1) on 
the non-negative real numbers and 

(b) for every n>l, A n (r) = A n+ i(r) + A n+ i(r + 1) for r = 0, 1, . . . , n. 

Moreover, if (a) and (b) hold, the measure > 1) induced on g rest by (fi n ,n > 1) through ft o a, 

i.e. /i* := fj, n a~ 1 f3~ 1 = f n /3 _1 , is infinitely exchangeable. 

Proof. Suppose (a) and (b) hold. Let n > 1 and suppose E G £ n . Then 



Mn([°n>£]) = ex P < 



iC\n]:e4E 



Clearly, the fj, n are finitely exchangeable for each n > 1 as the measure depends only on the cardinality 
of the elements of {e C [n] : e £ E}, which directly depends on the cardinality of the elements of E, 
which are invariant under permutations. 

As noted in section [LT] the pullback ^„,n under the restriction maps takes intervals to intervals. Hence, 
for E = {Ei, . . . ,Ef.} 6 £ m , the interval [0^, E] C £ m maps to the interval [0^, sup ip^ n(E)] C £ n , 
where sup (p^^E) = E U {m + 1, . . . , n} is the unique maximal element of (p^^E) in £ n . To simplify 
notation, write sup mn E := supip^^E) in what follows. 



Hence, for consistency under sampling by restriction maps we must have 



H n ([0 n ,E]) = fi n+1 ( [0 n+ i , sup nj „ +1 E) ) 



(2) 



Consistency follows from this since we now have 
Mn+l([°ra+l,sup TO) „ 

[eC[n+l]:e^sup roin B J 

which reduces (J2J) to 

^A„(#e)= W#eO (3) 

eC[n] e'C[n+l]:e'^sup min £; 

which is a sum over subsets of the power set of [n] and [n + 1] respectively. Under restriction, each 
e E 2^ corresponds to a two element subset of [n + 1], namely {e, e U {n + 1}}. Hence, for {e C [n] : 
e ^ -E} := {ei, . . . , e^}, we have {e' C [n + 1] : e' ^ sup m n -E} = {ey, U {re + 1} : j = 1, . . . , k} and 
Q is just 

£ A n (#e)= ^ [A„ +1 (#e) + A„ +1 (#e + l)]. 

eC[n]:e£E eC[n]:e§££ 
Infinite exchangeability for v n is endowed by \x n through a, as discussed in section [L6l 

For the reverse implication, note that we start with the condition on the mean measures A n 

A n (e)= [A„+i(e) +A n+ i(eU{n + l})] 

eC[n]:e£E eC[n]:e<£E 

and exchangeability requires A„(a) = A n (b) for all a,b C [n] with #a = #6 so we can reduce this to 
the collection of measures A n (-) taking values in the non-negative real numbers, as we have done above. 
And consistency requires of either /i n or u n requires (b) to hold. 

The infinite exchangeability of the measures (/i^,n > 1) on g rest is a direct corollary of the infinite 
exchangeability of /J, n and the action of the functor f3 o a : £ rest g rest □ 

Corollary 2.2. Suppose (X n (r),n > 1, r = 0, . . . , n) is a doubly indexed sequence of non-negative real 
numbers satisfying (p}, then there exists a measure n* on Q, the space of graphs with vertex set N such 
that 

ti(G) = ({g* e g : q w = g)) , 

where G?r n i denotes the restriction of G* to the vertex set [n] and /i* are the induced measures on Q n 
given above. 

We make note of a correspondence between the solutions to ([1]) and the classical Hausdorff moment 
problem. Connection between the Hausdorff moment problem and de Finetti's theorem have been 
shown by Diaconis and Freedman [8] and are well known throughout the literature. Some usable 
choices for (A n (r),0 < r < n) are 

• A n (r) oc a r (l — a) n ~ r for < a < 1 and 
. A n (r) oc (?)-\ 



3 Cluster analysis 



Given a measure /i* on Q n which is based on a collection (A n (r),n > 1, r = 0, 1, . . . , re) which satisfy 
dH), we can easily calculate the marginal distribution that a triple of vertices, e.g. i,j,k, is transitive, 
i.e. i ~ j,i ~ k,j ~ k, given two are adjacent, e.g. i ~ j and i ~ k, by a standard exchangeability 
argument. In particular, for any n > 3 and i,j, k £ [n] we have 

M;[{i~i,*~A:,J~*;}|{*~i,*~A:}] = ^[{1 ~ 2,1 ~ 3,2 ~ 3}|{1 ~ 2, 1 ~ 3}] 

l_ e -A 3 (3) (1 _ (1 _ e -A 3 (2) ) 3 ) 
l_ e -A 3 (3)( 1 _( 1 _ e -A 3 (2))2)' 

which, unlike the Erdos-Renyi process, does not correspond to the clustering coefficient. This calculation 
only accounts for the subgraph comprised of the vertices k and any edges between them. In general, 
if we observe a graph G € Q n and we are told that i ~ j and i ~ k but not told if there is an edge 
between j and k or not, but we also know the rest of the graph structure, this information is relevant 
for determining whether or not j ~ k in G, and has implications in the inference of missing links in 
network data sets. 

For example, consider the two graphs G\ and G2 below where 
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and the presence or absence of an edge between 1 and 2 is unknown, but the rest of this network is 
known. Given what we do know about G\, there are three possible monotone sets which correspond to 
Gi, 

({1, 2}, {1, 3}, {2, 3}, {3, 4}), ({1, 2, 3}, {3, 4}), ({1, 3}, {2, 3}, {3, 4}). 

On the other hand, there are five monotone sets corresponding to G2, 

({1,2, 4}, {1,2,3}), ({1,2,3}, {1,3}, {2,3}) ({1, 2, 3}, {1, 4}, {2, 4}) 

({1, 2}, {1, 4}, {2, 4}, {2, 3}, {1, 3}), ({1, 3}, {1, 4}, {2, 3}, {2, 4}). 

So the information given by the rest of the network, and any edges, or lack thereof, which involve any 
of the i, j, k of interest, affects the conditional probability of j ~ k, e.g. 1 ~ 2 in this case. 

In general, the clustering of this process is expected to be larger than the marginal probability expression 
above as the presence of a cluster of three vertices increases the probability of other clusters which 
involve these vertices. The nature of the construction, e.g. description of the functor j3 o a, leads to 
overlapping of the various subsets of A which is "forgotten" in the projection onto Q n and provides 
various ways by which clustering can occur. 



The construction also allows us to move between the space of graphs and that of monotone sets, which 
is helpful in calculations. 



3.1 Detecting clusters 



The nature of this construction naturally lends itself to methods in cluster analysis, which has been 
studied in certain applications in statistics and machine learning [21 [TOj [TT]. The setting is as follows. 
Let n > 1 be the size of a sample for which we label statistical units, e.g. individuals, arbitrarily in [n] 
and observe a network for this sample, i.e. an undirected graph G € Q n . Along with [n], let ~i, . . . , ~& 
be a collection of different equivalence relations on [n]. A collection of labels . . . ,i m } is said to 
form a cluster, or community, in our network if, for some / = 1, . . . , k, i p ~j i q for every p,q = 1, . . . , m. 
Inferring clusters in networks has implications, for example, in the problem of data deduplication and 
parsing for semi-structured text data sets as well as inferring communities and missing links in social 
networks. 

In a statistical setting for social networks could represent different 'types' of relationships among indi- 
viduals. That is, individuals in a social network are associated by certain relationships which underlie 
the network, e.g. a 4-node clique in a network could arise from 4 nodes belonging to the same cluster, or 
due to the overlap of two 3-node clusters and a 2-node cluster, e.g. {1, 2, 3}, {1, 2, 4} and {3,4}, which 
both result in the presence of the clique {1, 2, 3, 4} in the projected network. 

3.1.1 A statistical model 

In the setting of section [2] consider an infinite population U of units from which we sample a finite 
number n > 1 which we label in [n], i.e. our sample is u\, . . . ,u n , and we observe for this sample a 
network, or graph, G £ Q n which we assume to have been generated according to the Poisson point 
process (PPP) construction on 2^ which we laid out above. In particular, let A := (A ra (-),n > 1) be 
a family of mean measures on (2^,n > 1) such that A n (a) = \ n (#a) for every a and the A n (-) satisfy 
CO). Let fi n , v n and /i^ be the measures on £ n , J : n and Q n defined in above sections. Then we have the 
following model: 



We now imagine reversing this process to infer whether a given complete subgraph H C G of units 
{til, . . . , Uh} represents a cluster of U\, . . . , u n . This amounts to computing the conditional probability 
that X H > given that H C G. 

Suppose we observe a network G £ Q n with complete subgraph H. Under the inverse image of the 
functor j3, we have that /3 -1 (G) E T n is a collection of monotone sets which correspond to G. Fur- 
thermore, the inverse image of the least monotone cover a^ 1 [(3^ 1 (G)] is a collection of possible subsets 
of 2^ which have least monotone covers corresponding to fi~ 1 (G). As we have shown in theorem 
12. 1| the consistency condition in ([1]) guarantees infinite exchangeability of (// n ) as well as (/U n a _1 ) and 
(Unifiay 1 ) on J- rest and £ rest respectively. 

Hence (f5a)~ 1 (G) is the collection of random subsets in £ n which correspond to G. Define Z : = 
Z(H, G) :C (/3a) _1 (G) to be the elements of (/3a) _1 (G) which contain the set H, and Z the complement 
of Z in (/3a)^ 1 (G), i.e. Z := ((3a)~ 1 (G) — Z. Then the conditional probability that H is a cluster in 
the subsample of U given G is 



G\A 



X 



PPP(A) 
fj, . 



tt(H;G) := W(H a cluster) = fi n (Z)/n n [(f3ay 1 (G)}. 



(4) 



The conditional probability calculation in (0| is valid for inference of a cluster H if we are interested in 
the presence specifically of a cluster with exactly the elements of H and not a cluster H* D H which 
contains the entire cluster H. In applications, we might not be able to observe clusters at such a fine 
level. In particular, if there is a cluster H* 5 H in the population, we may not be able to observe 
the presence or absence of a sub-cluster H within the larger cluster. In this case, it is reasonable to 
consider performing inference at a coarser level. That is, for a given collection H := {u±, . . . ,Uh}, we 
wish to determine whether there is a cluster H* D H in the population. 

Given a graph with clique H, let H*(H,G) :C /3 _1 (G) be the monotone subset of j3~ l {G) such that 
H £ H* . In particular, 

H*(H, G) := [H, l n }n rHG)- 
Given G, the conditional probability of H*(H, G) is 

F(H*(H, G)\G) = v n [[H, l n ] n r\G)] /v n \p-\G)\. 
3.1.2 Stochastic classification 

Stochastic classification models have been studied previously by McCullagh and Yang |11) in the context 
of the Gauss-Ewens clustering process. In that model, a finite sample u\, . . . , u n is taken from an infinite 
population hi of units for which we observe some feature Yi := Y(m) £ S, for some subspace S, usually 
S C K. d . Associated to hi is a partition B of N and conditional on B the vector Y := (Yi, . . . ,Y n ) is 
normally distributed with mean and covariance which depend on B only through the restriction £?ir n i 
of B to [n\. For a newly sampled individual u*, it is shown how to classify u* based on its feature 
Y* := Y(u*) and the data (Y±, . . . ,Y n ) and Bu n -\ already obtained. This is carried out by computing 
the conditional distribution that u* belongs to each block b 6 Sir n i or a possibly new block of B given 
the data Y\, . . . , Y n , Y* and Bir n u The infinite exchangeability of the Gauss-Ewens process is a tool 
which allows the computation of conditional distributions in this setting. 

In our setting, we also assume an infinite population hi and an associated network (undirected graph) G 
of hi which is generated by the Poisson point process recipe above. Suppose we sample u\, . . . , u n from 
hi and we observe the restriction Griui of G to [n] as well as the component of £ n which corresponds to 
<jr|[ n ]. That is, we assume we observe the realization of X* , the projection of the Poisson point process 
on 2^ onto £ n . For a new individual u* € hi, suppose we observe its connections within the network, 
i.e. we have complete knowledge of t?i [n+l] > Du ^ nothing about its associated clusters in £ n +i- 

Infinite exchangeability of the measures ([J, n ,n > 1) on £" rest makes the specification of the conditional 
distribution of various outcomes straightforward. Given X* G £ n and G € G n +i, let H* € £ n +i be a 
collection which is consistent with X* and G, i.e. H* C (/3a) _1 (G) H tp^ n+1 (X*), we have 

r>v(H*\G X*) = Vn+i(H*) 

» n+1 [(Pa)-i(G)n^ +1 (X*)) 
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